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Abstract —Object detection is an important task in com¬ 
puter vision and learning systems. Multistage particle win¬ 
dows (MPW), proposed by Gualdi et aL, is an algorithm 
of fast and accurate object detection. By sampling particle 
windows from a proposal distribution (PD), MPW avoids 
exhaustively scanning the image. Despite its success, it is 
unknown how to determine the number of stages and the 
number of particle windows in each stage. Moreover, it has 
to generate too many particle windows in the initialization 
step and it redraws unnecessary too many particle windows 
around object-like regions. In this paper, we attempt to 
solve the problems of MPW. An important fact we used is 
that there is a large probability for a randomly generated 
particle window not to contain the object because the object 
is a sparse event relevant to the huge number of candidate 
windows. Therefore, we design the proposal distribution so as 
to efficiently reject the huge number of non-object windows. 
Specifically, we propose the concepts of rejection, acceptance, 
and ambiguity windows and regions. This contrasts to MPW 
which utilizes only on region of support. The PD of MPW 
is acceptance-oriented whereas the PD of our method (called 
iPW) is rejection-oriented. Experimental results on human 
and face detection demonstrate the efficiency and effectiveness 
of the iPW algorithm. The source code is publicly accessible. 

Index Terms —Object detection, particle windows, random 
sampling, feature extraction. 

1. Introduction 

O BJECT detection is a key component of many com¬ 
puter vision systems 12^ . E). Generally, object 
detection consists of two steps: feature extraction and 
classification m, Ea, a, ca. In this paper, we di¬ 
vide the task of object detection into three steps: window 
generation, feature extraction, and classification. Window 
generation outputs windows determined by shape, location 
and size. Features are extracted from the windows and 
then are classified by a classifier. Suppose N windows are 
generated and the time spent in generating the windows is 
tw Let f/ be the time of extracting a feature vector from 
a window (i.e., subimage) and tc be the time of classifying 
the feature vector as either positive or negative class. Then 
the computation time t of an object detection algorithm can 
be expressed as: 

t = “h N X tf “h N X tfj. (1) 
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Usually, tw is very small and can be neglected. Obvi¬ 
ously, it is desirable if N is as small as possible on the 
condition that the detection rate and false positive rate are 
acceptable, which is the goal of our algorithm. 

A dominant manner of window generation is sliding 
window (i.e., SW) based scanning. Given the pixel stride 
and scale factor, windows are determinately generated from 
top to bottom, left to right, and small to large. This deter¬ 
ministic manner requires a very large number of windows 
in order to detect objects with high detection rate. 

Windows can also be generated in a stochastic (random) 
manner. The stochastic manner can also be categorized into 
active sampling Recently, Gualdi et al. na proposed 
to generate the windows (called particle windows) by sam¬ 
pling from a probability density function which is called 
proposal distribution (PD). This algorithm is called multi¬ 
stage particle window with abbreviation MPW. The initial 
PD is uniform distribution, meaning that each candidate 
window has the same chance to contain the object. If 
some windows of the sampled particle windows (called 
particle windows in ifTSl ) have large classifier responses, 
then the PD is updated by enhancing the positions nearby 
these windows. Consequently, when sampling from the 
updated PD, more windows will be generated near the 
previous particle windows. Therefore, a smaller number 
Ni{Ni < Ni) of particle windows is needed to be drawn 
from the updated PD, which is why the MPW method can 
use smaller number of windows to get the same detection 
rate and false positive rate as the SW method. 

Despite the great success of MPW, there is still room to 
improve it. Due to the non-negativity of the weights and 
density, almost all particle windows are sampled from the 
regions neighboring to the particle windows obtained in the 
previous stages. Therefore, if the number of initial particle 
windows is not large enough to contain true positives, 
then there is a very large probability that MPW will 
not sample the positive windows any more. In addition, 
MPW generates unnecessary too many particle windows 
around the object and object-like regions. Considering that 
classification of these regions is more time-consuming than 
obvious non-object regions in the algorithm of cascade 
AdaBoost, so generating too many particle windows around 
the object and object-like regions greatly limits its effi¬ 
ciency. Importantly, it is unknown to use how many particle 
windows in each stage. 

In this paper, we propose to improve MPW with the aim 
of sampling a smaller number of particle windows without 
any loss in detection rate and false positive rate. In addition. 
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we solve the problem of determining the number of particle 
windows in each stage. We call it iPW. Compared to MPW, 
iPW has the following characteristics and advantages. 

1) iPW does not need to generate a large number of 
particle windows at the initial iteration (stage), which 
greatly reduces the detection time. Even if the initial 
particle windows do not contain any object, iPW can 
detect the objects at next stages. 

2) MPW draws unnecessary too many particle windows 
around the positive windows whereas iPW avoids 
generating the redundant particle windows by using 
the information of both rejected and accepted particle 
windows. 

3) To use MPW, one has to empirically set the number 
of particle windows in each stage whereas this is not 
a problem because iPW generates a single particle 
window in each iteration (stage). 

4) iPW fully makes use of the information of rejected 
negative particle windows while MPW almost com¬ 
pletely depends on the accepted positive particle win¬ 
dows. Rejected negative particle windows are used to 
directly suppress the PD around these windows and 
at the same time indirectly enhance the PD beyond 
the windows. In this sense, iPW is rejection-oriented 
while MPW is acceptance-oriented. 

5) In MPW, the uniform distribution is used in the 
initialization stage and plays an unimportant role in 
the later stages so that it can be omitted. In iPW, 
a dented uniform distribution is used to play an 
important role for sampling useful particle windows 
by rejecting impossible regions. 

6) iPW utilizes dented Gaussian distribution while 
MPW utilizes full Gaussian distribution for sampling 
particle windows. By using dented Gaussian distribu¬ 
tion, iPW avoids drawing many unnecessary particle 
windows around the object and object-like regions. 

7) To obtain the same detection accuracy, the total 
number of particle windows in iPW is much smaller 
than that in MPW. 

The remainder of the paper is organized as follows: In 
Section 2, related work is discussed. Section 3 reviews the 
MPW algorithm. The proposed iPW algorithm is described 
in Section 4. Experimental results are given in Section 5 
before summarizing and concluding in Section 6. 

II. Related Work 

According to (1), the computation time of an object de¬ 
tection algorithm is determined by window generation, fea¬ 
ture extraction, classification, and the number of windows. 
Accordingly, we can categorize existing efficient object 
detection algorithms into feature-reduced, classification- 
reduced, and window-reduced types. In addition, combi¬ 
nation of the different types of algorithm should also be 
considered. 

Cascade AdaBoost plus Haar-like features can be 
viewed as classical combination of feature-reduced and 
classification-reduced method ED, lEo). In contrast, neural 


network based face detection ED is time-consuming in 
feature extraction and classification though it has compa¬ 
rable detection accuracy. Similarly, Histogram of Oriented 
Gradients (HOG) plus Support Vector Machine (SVM) lIT^ 
can be improved in efficiency by using integral image 
and cascade structure. The trilinear interpolation of HOG 
can also be approximated by decomposing gradients into 
different angle planes followed by simple smoothing 1^ . 
There are many efficient object detection algorithms using 
the technique of integral image for extracting simple but 
rich features. One important type of features for human 
detection is integral channel features a, 0. 

Designing optimal cascade structure is also an important 
topic for increasing the speed of object detection. Recent 
methods include crosstalk cascade CD, CoBE 0, LAC- 
Boost Eg), ES, sparse decision DAGS {directed acyclic 
graphs) f7l|, etc. 

In addition to the integral-image based algorithm, coarse- 
to-fine feature hierarchy uni, 1461 and template matching 
with binary representation CD are also feature-reduced 
methods. The coarse-to-fine feature hierarchy is able to 
reject the majority of negative windows by the lower reso¬ 
lution features and process a small number of windows by 
higher resolution features 1461 . By deep analysis of statistic 
of multiscale features. Dollar et al. cni developed an 
efficient scheme for computing feature pyramids. Liu et al. 
El developed a probability-based pedestrian mask which 
can be used as pre-filter to filter out many non-pedestrian 
regions. Template matching with binary representation for 
gradient information is a promising method for detecting 
textureless objects in real time ca. Owing to its ele¬ 
gant feature representation and the architecture of modern 
computers, template matching with binary representation 
for gradient information can use thousands of arbitrarily 
sized and shaped templates for object detection in very fast 
speed ESI. By setting proper sliding stride, features can 
be reused to avoid computing the features in a window 
overlapping with its neighbors (321. The information of 
spatial overlap can also be used for image matching and 
recognition (T]. Deep learning with rich features hierarchy 
is also a promising direction (T^ . 

Classifier-reduced method arrives at high efficiency by 
designing efficient classifier. In addition to cascade Ad¬ 
aBoost which uses a few of classifier to reject a lot of win¬ 
dows, one can design efficient linear and nonlinear SVM to 
classification. Vedaldi and Zisserman proposed to use ex¬ 
plicit, instead of implicit, feature maps to approximate non¬ 
linear SVM (401 . Mak and Kung developed a low-power 
SVM classifier where the scoring function of polynomial 
SVMs can be written in a matrix-vector-multiplication form 
so that the resulting complexity becomes independent of the 
number of support vectors ED, ESI. Linear SVM classifies 
a feature vector by computing the inner product between 
the sum of weighted support vectors and the feature vector. 
To reduce the computation complexity of the inner-product 
based classification. Pang et al. proposed a sparse inner- 
product algorithm (33l . The idea is that neighboring sub¬ 
images are also neighboring to each other in the feature 
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space and have similar classifier responses. Pang et al. also 
developed a distributed strategy for computing the classifier 
response 1261 . 

Window-reduced method is a promising direction devel¬ 
oped in recent five years. This kind of methods aim at 
reducing the number of windows where feature extraction 
and classification have to be conducted. When an image 
is represented by a small number of keypoints and their 
descriptors (i.e., visual words), branch&bound (a.k.a., ef¬ 
ficient sub window wearch (ESS)) is very efficient because 
it hierarchically splits the parameter spaces into disjoint 
spaces and uses quality functions to reject large parts of the 
parameter space 1^ . Branch&bound can also be used in 
implicit shape model which adopts hough-style voting 1221 . 
1231 . Branch&rank generalizes the idea of branch&bound 
by learning a ranking function that prioritises hypothesis 
sets that do contain an object over those that do not 1^ . 
Acting testing is also a promising method for rapid object 
detection 0, El. But these visual-word based methods 
are not suitable for detecting textureless objects because it 
is not reliable to detect keypoints from the objects. 

As a signal can be reconstructed from irregularly sampled 
points El, an object can be detected by random sampling a 
fraction of all the possible locations and scales. Multi-stage 
particle window (MPW) is a window-reduced object de¬ 
tection method ca using random sampling. Branch&rank 
and branch&bound are based on keypoint detection whereas 
MPW extracts Haar-like features, HOG, or other features as 
the same manner of sliding window based object detection 
method. Therefore, MPW is expected to be suitable for 
detecting both texture-rich and texture-less objects. Sliding 
window based method investigates all the windows overlap- 
pingly and uniformly spaced in spatial and scale domains. 
In contrast, MPW only checks the windows generated from 
an updating proposal distribution. As iteration proceeds, 
the main peaks of the distribution evolve towards the 
objects. The open problem in MPW is how many windows 
should be generated in each iteration (stage). Existing MPW 
uses empirical numbers which is hard to result in optimal 
solution. In this paper, we propose a novel particle window 
based object detection method that is more efficient than 
MPW and avoids choosing particle numbers in multiple 
stages. 

It is noted that the technique of Detection Proposals (DP) 
||48]| (sometimes called Objectness O or Selective Search 
(391) also generates a number of windows by sampling 
from all the possible windows. But the number of generated 
windows has to be large (e.g., 103 or 104) enough if 
acceptable detection quality is required. Moreover, the time 
spend on generating detection proposals is not satisfying 
(see Table 2 of El)- Nevertheless, the DP methods have 
been successfully employed in deep leaning based detection 

(SI. 


III. Multi-stage Particle Windows 

The algorithm of multi-stage particle window (MPW) 
El is the basis of our method. 


A. Algorithm 

MPW investigates a fraction of all candidate windows in 
an image by sampling from a proposal distribution. Each 
particle window represents a window w = (x, 5 )^ where 

X, y and s are the horizontal position, the vertical position 
and the size of the window, respectively. The window can 
also be expressed as w(x,^, s). Once a window w is 
generated, the feature vector is then extracted and classified. 
Let /(v^) be the classifier response. The main issue of 
MPW is how to design the proposal distribution. 

At the beginning of the MPW algorithm, no prior 
knowledge is known about the preference on the candidate 
windows. So the proposal distribution go(w) = qo{x,y,s) 
is modeled as a uniform distribution u{w) = u{x, y, s) = 
l/N, where N is number of all possible windows in the 
image. 

In the first iteration, particle windows are drawn from 
the uniform proposal distribution go(w) = u{'w) (see Pig. 
1(a)). The classifier response /(w^) is normalized by 


/(Wi) ^ 


/(Wj) 


( 2 ) 


so that — 1, 0 — — 1- Then the 

proposal distribution is updated according to the classifier 
responses /(w) of the Ni particle windows: 


E i V 1 

^.^i/(wj)G(wj,S). (3) 

In ([^, G(wj,E) is a Gaussian distribution where the 
mean is centered at Wj and E is the covariance of Gaussian 
distribution. The weight ai balances the previous proposal 
distribution qq and the mixture of Gaussian distributions. 
Gualdi et al. El found that ai = 1 is almost the best 
choice, meaning the unimportance of previous proposal 
distribution. In Section 3.2 we will explain why = 1 
is a reasonable choice. 

The sum term in is called measurement density 
function pi (w) (T5]| : 


fi(w) = y] , /(wj)G'(wj, S). (4) 

If 0(1 = 1, then the proposal distribution gi(w) is 
identical to the measurement density function pi(w). 

In the second iteration, N 2 particle windows are drawn 
from gi(v^). Usually, N 2 is smaller than A^i. Because the 
classifier response /(w) is large in the regions nearby the 
positives, most of the N 2 particle windows lie around the 
positives (see Pig. 1(b)). 

The iteration continues with the new proposal distribu¬ 
tion in stage i\ 


9i(w)=(l-ai)ft-i(w)+ai 2^ .^^/(wj)G(wj,Si). (5) 


If ai = 1, the proposal distribution becomes: 


®(w) = Pi{w) = gi{w) = 2^ /(wj)G'(wj, Si), (6) 


which is in fact the employed proposal distribution in the 
experiments in El 

Algorithm 1 shows the procedure of MPW. 
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(a) (b) 


Fig. 1. The process of MPW . [^ Stage 1 samples initial particle windows 
by uniform distribution. |(b)| Stage 2 generates particle windows around 
pedestrian. 


Algorithm 1 The algorithm of MPW. 

Input: 

Stage number S; 

The number Ni of particle windows in stage i, i = 

The number N of all candidate windows. 

Output: 

The set Wp of positive particle windows. 

1 : Initialization 

2: Empty the set of positive particle windows: Wp ^ 

3: Initialize the proposal distribution g{w) by uniform 
distribution u{w) = 1/N, i.e., g{w) ^ 1/N. 

4: for s = 1 to aS do 

5: Sample Ni particle windows from g{'w). 

6: Put the Ni particle windows into W, i.e., 

W = {avi, ... ,AVArJ. 

7: If f{wj) = 1, j=l, ...,Ni, then Wp = Wp U wj. 

8: Update g{'w) using the W, and empty W. 

9: end for 
10: return Wp. 


B. Why ai = 1 

In this section, we explain why = 1 is a reasonable 
choice. To the best of our knowledge, we are the first to 
explain why ai = 1. 

For the sake of simplicity, we assume that the scale is 
fixed and there is only one object in the image. For an 
h X w image, the number of candidate windows is M = 
h X w. Fet the size of support region be m ^ M. In the 
initialization step, Ni particle windows are drawn from the 
uniform distribution go(w) = u{w). Then the probability 
p that the Ni particle windows contain the object is p = 
1 — (1 — m/M )^^. Suppose that M = 640 x 480, m = 50, 
and Ni = 1000, then p = 0.15. Obviously, if A^i is small, 
it is a small probability that particle windows contain the 
object. 

In the later stage, MPW generates a smaller number 
of particle windows, where (1 — ai) fraction is from the 
uniform distribution. The probability that the (1 — ai) 
fraction of particle windows contains the object is much 
smaller. So ai is usually set 1. 


TABLE I 

Representative values oe Ni 


Stage number i 

I 

2 

3 

4 

5 

Ni 

2000 

1288 

829 

534 

349 


C. Merits and Limitations 

Compared to SW, MPW is able to obtain similar accuracy 
at lower computational load ca. However, it is not clear 
that how to optimally select the number m of stages and 
the number Ni of particle windows in each stage. Guadldi 
et al. gives an empirical rule for parameter selection: 

ATi = X (7) 

where A^i is the initial number of particle windows. The 
empirical values of m and 7 are 5 and 0.44, respectively. 
The exponential rule of 0 makes the number Ni decrease 
from stage to stage. Representative values of Ni are given 
in Table 1. 

There are three problems with the MPW algorithm: 

1) Ni has to be large enough so that the Ni particle 
windows to some extent overlap the objects in the 
image. Otherwise, A^ 2 , • • •, N^ particle windows in 
later stages are hard to detect the objects. Extremely, 
if none of the number of particle windows are 
positives, then the subsequent N 2 new particle win¬ 
dows sampled from gi(w) = Ylfh 

will not contain any clue of the location about the 
objects because G{wi, F) is meaningless in this case. 

2) MPW generates too many unnecessary particle win¬ 
dows around the object and object-like regions. Con¬ 
sidering that classification of these regions is more 
time-consuming than obviously non-object regions in 
the algorithm of cascade AdaBoost, generating too 
many particle windows around the object and object¬ 
like regions greatly limits its efficiency. 

3) The rule of parameter selection is not guaranteed to 
be optimal, because there is no reason to support that 
the values in Table 1 are the best. 

IV. Improved Particle Windows (iPW) 

In this section, we propose to improve MPW in order to 
detect the objects in an image with a smaller number of 
particle windows (PWs). 

Firstly, we define several concepts: rejection particle win¬ 
dow, acceptance particle window, and ambiguity particle 
window. Secondly, the concepts of regions of rejection and 
acceptance are introduced. Finally, we will describe iPW 
algorithm based on these concepts and explain why iPW is 
superior to MPW. 

A. Rejection, Acceptance, and Ambiguity Windows 

As stated in Section 3, each particle window represents a 
window v^ = {x^y^s)"^ where x, y and s are the horizontal 
position, the vertical position and the size of the window, 
respectively. The particle window can also be expressed as 
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w(x,^,s). In the following, we use w{x,y) to represent 
w(x,^,5) when s is fixed. 

We employ the classifier response /(w) and its low 
and high thresholds (i.e., ti and th) to define rejection, 
acceptance, and ambiguity particle windows, respectively: 

1) A window w is called Rejection PW (RPW) if 
f{w) < ti. Rejection PW is the particle window 
which can be definitely classified as negative class 
due to its low classifier response. 

2) A window w is called Acceptance PW (APW) if 
/(w) > th. Acceptance PW is the particle window 
which can be safely classified as positive class (ob¬ 
ject) because of its high value of classifier response. 

3) A window w is called Ambiguity PW (ABPW) if 
U < /(w) < th. One cannot classify ambiguity 
PW as positive class because its classifier response 
is not large enough, meanwhile cannot classify it as 
negative class because its classifier response is not 
low enough. And we call the set of the ambiguity 
particle windows W^^- 



Fig. 2. |(a)| The orig inal images. |(b)| Th e ave rage classifier response 

/(wj) of the windows. |(c)| A profile of (b). |(d)| The rejection radius vr 
varies with /(w^). 


B. Regions of Rejection and Acceptance 

Region of Rejection (RoR) If a particle window v^ is 
considered as a rejection PW, it can be used to securely 
reject a set of nearby windows. The centers of the nearby 
windows and w itself are called Region of Rejection 
(RoR) of w. We denote the RoR by 

nij(w) = {a;, 2 /| ||wi - w|| < rij}, (8) 

where the radius vr is the maximum radius 

= max||wi - w||, s.t. f{wi) < ti. rg) 

Vr is a function of the classifier response /(w). In Fig. 
2(d), the smaller /(w) is, the larger vr is. 

All the windows belonging to Ri^(w) are represented 
as Wr{w). 

Assumption 1 (Rejection Assumption). If a window w 
is classified as a rejection window due to /(w) < ti then 
all the windows Wi^(vv') can be directly rejected (i.e., 
labeled as negatives) without the necessity of computing 
the classifier response f{wi)^Wi G Wi^(vv'). 

This assumption is illustrated in Fig. 3(c), the red part 
in the image consists of regions of rejection. 

Region of Acceptance (RoA) If a particle window v^ is 
considered as an acceptance PW, it can be used to securely 
accept a set of nearby windows. The centers of the nearby 
windows and w itself form Region of Accept (RoA) of 
w. Mathematically, RoA can be denoted by R^: 

R-a(w) = {x, y\ I |wi - w| I < ta}, (10) 
where the radius va is the maximum radius 

= max||wj - w||, s.t. > U- (11) 

Wi ^ 


Note that the constant is /(w^) > ti instead of f{wi) > 
th. Obviously, va with /(w^) > ti is larger than va with 
/(Wi) > th. 

All the windows belonging to Ryi(w) are represented 
as Wyi(v^). 

Assumption 2 (Acceptance Assumption). If a window 
w is classified as acceptance window due to /(w) > th 
then all the windows Wa(w) should be directly accepted 
(i.e., labeled as positives) without the necessity of 
computing the classifier response f{wi),Wi G Wa(w). 

In Fig. 3(d), the blue part in the image consists of the 
regions of acceptance. 

Let the center of a window w coincide with the center 
of an object and the size (scale) of the windows match 
that of the object very well (Fig. 2(a)). Then we compute 
the classifier responses of the neighboring windows . As 
shown in Fig. 2(b), it is usual that /(w) is the largest and 
f{wi) decreases with the distance ||w^ — w||. In Fig. 2(c), 
the radius of va is chosen so that it satisfies f{wi) > ti. 
Note that in Fig. 2(d), the rejection radius vr is derived 
from Fig. 2(c). 

C. iPW: Rejection-based Random Sampling 

In this section, we describe iPW based on the concepts 
of RPW, APW, and ABPW, and their corresponding RoR 
and RoA. 

1) Proposal Distribution of iPW: We introduce the mo¬ 
tivation of iPW by firstly analyzing the properties and 
limitations of MPW. 

Rejection Particle Windows and Dented Uniform Dis¬ 
tribution ur{w) As discussed in Section 3.1, the particle 
windows of MPW are sampled from f{wi)G{wi,'E). 
The contribution of a particle windows v^^ is determined 
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(a) 



I 


(b) 




(c) (d) 

Fig. 3. Illustration of R oR and RoA. The orig inal image. |(b)| 
The classifier response. |(c)| Regions of rejection (RoR). |(d)| Regions of 
acceptance (RoA). 


by its weight /(w^). If /(w^) is very small, the 
contributes little to object detection because subsequent 
stage will not sample windows from the corresponding 
distribution /(w^)(7(w^, S). However, if only a small num¬ 
ber of particle windows is generated, most of them will 
have small weights and the regions of objects will be not 
sampled . In this paper, we proposed how to make use of 
these particle windows with small weights (i.e., RPWs) for 
efficient object detection. One of the main contributions of 
the paper is to use rejection particle windows and the 
corresponding Wi^(wi) (i.e., the set of windows insides 
the region Ri^(w^)) to form a dented uniform distribution 
Let the number of RPWs be Nr, then ur{w) is 
expressed as: 

fo, w e {Wfl(wi) U ... U Wfl(wiVH)}, 

ur{w) = <^ 

otherwise, 

( 12 ) 

where ur satisfies fJJ -^dxdyds = 1. Fig. 4(b) illustrates 
an one-dimensional dented uniform distribution. Obviously, 
sampling from this dented uniform distribution avoids 
drawing windows from the existing regions of rejection. 

Acceptance Particle Window and Dented Uniform 
Distribution ua{^) In MPW, when a particle win¬ 
dow (i.e., acceptance particle window) has large 
weight f{wi), it has large contribution to the distribution 
Sampling from this updated distribu¬ 
tion will generate particle windows overlapping or even 
coinciding with . We think that it is redundant to resam¬ 
ple the acceptance window v^^ and its close neighbors. To 
avoid the redundancy, we propose to employ the acceptance 
particle windows to maintain and update a dented uniform 


M(w)i 

k M«(wi 








Wj,(w,) 

W^(W2) 



Wi 





(a) (b) 



(c) (d) 

Fig. 4. Uniform distri bution and dented uniform distribution. |(a)| 
Uniform distribution. |(b)| Dented uniform distribu tion ur(w) formed 
by two rejection particle windows wi and W 2 . |(c)| Dented uniform 
distr ibut ion ua (w) formed by two acceptance particle windows W 3 and 
W4. 1 (d) I Mixture dented unform distribution by combing the rejection and 
acceptance particle windows. 


distribution fi^(w): 

fo, w e {Wa(wi) U ... U W^(wjv^)}, 
«a(w) = <^ 

Otherwise, 

(13) 

where a a satisfies Jff -^dxdyds = 1, and Na is the 
number of acceptance particle windows. Fig. 4(c) illustrates 
a fiA(w). 

Obviously, both the rejection and acceptance particle 
windows play role in excluding regions in uniform distri¬ 
bution. Therefore, as illustrated in Fig. 4(d), we propose to 
combine ur{w) and ua{^) into a unified dented uniform 
distribution u{w) = ur{mv) x fi^(w): 


u{w) 


0, AvGjWi^UWA}, 
otherwise, 


(14) 


where a satisfies Jff ^dxdyds = 1. 

Ambiguity Particle Windows and Dented Gaussian 
Distribution Suppose that there are Nab ambiguity par¬ 
ticle windows in With the class label being ei¬ 

ther positive or negative, the region nearby the ambiguity 
particle window exhibits the largest uncertainty relative to 
the rejection particle windows and the acceptance particle 
windows, so it contains potential clue for objects. One way 
to exploit the Nab ambiguity particle windows is directly 
using them for modeling a mixture of Gaussian distribution 
5(w): 

fi'(w) = L], /(wi)G'(wi, S). (15) 

However, our experimental results show that sampling 
from g{w) may result in particle windows overlapping 
with the existing rejection or acceptance particle windows. 
Clearly, it is useless to sample such particle windows. 
Therefore, to remove the redundancy, we propose to model 
a mixture of dented Gaussian distribution g{w) using 
the Nab ambiguity particle windows with the help of 
the existing rejection and acceptance particle windows or 
equivalently the current dented uniform distribution u{w): 

5(w) = /(wi) [G(wi,S) X (a X fi(w))]. (16) 

^ -^2=1 
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Fig. 5. Dented Gaussian distribution. |(a) [ Gaussian distribution. |(b) [ Dented 
uniform distribution. Mixture of dented Gaussian distribution. 


Because a x {t(w) = 0 in the regions of rejection and 
acceptance and a x u{w) = 1 elsewhere, so multiplying 
G(w^, H) with a x u{w) results in dented Gaussian distri¬ 
bution as is illustrated in Fig. 5. In Fig. 5(a) two ambiguity 
particle windows wi and W 2 in ab form a mixture of 
Gaussian distribution ^(w). In Fig. 5(b) a rejection particle 
window mvr and an acceptance particle window form a 
dented uniform distribution u{mv). Fig. 5(c) shows the final 
dented Gaussian ditribution ^(w). 

Proposal Distribution. The proposal distribution g(w) 
of iPW is a weighted average of the dented uniform distri¬ 
bution fi(w) and mixture of dented Gaussian distribution 
5(w): 

g(w) = P„ X u{w) +PgX g{w) 

= P„ X u{w) 

N.s (17) 

+ PgYl 7S) X (a X {((w))]. 

More generally, the proposal distribution qi{w) in stage 
i is expressed as 

ft(w) = P„{z) X Ui{w) 

+ Pgi^) /(wj) [G(wj, S) X (a X Uiiw))]. 
i=i 

(18) 

The weights Pu and Pg can be regarded as the pos¬ 
terior probabilities for to be generated from fi(w) 
and ^(w), respectively. That is p(ii(v^)|v^^) = Pu and 
= Pg which can be respectively defined by 

N f} N A 

P„ = a X (1- ^ ), and Pg = 1 - P„. (19) 

In Of G [0,1] is used for performance adjusting. Nr = 
|Wi^| and Na = |W^| are the numbers of rejection and 
acceptance particle windows in Wi? and W^, respectively. 

Hypothesis 1. Suppose there are two stages and the 
number of particle windows in stage 1 and stage 2 are 
Ni and N 2 , respectively. In both MPW and iPW, the Ni 
particle windows are generated from the same uniform 
distribution. But the N 2 particle windows in stage 2 of 
MPW are sampled from g'(w) whereas they are sampled 


from g(w) in iPW. Then the probability for N 2 particle 
windows in iPW to contain the object is larger than that in 
MPW. 

It is trivial to prove Hypothesis 1. If there are non-zero 
number of rejection and/or acceptance particle windows, 
then the search region is reduced by these particle win¬ 
dows (equivalently, the dented uniform distribution iii(w)). 
Consequently, sampling the same number of particle win¬ 
dows from reduced search domain is better than from the 
original large domain in the sense of detecting the objects. 
Generally, if both iPW and MPW use the same number 
of particle windows in each stage, then the probability for 
iPW to detect the objects is larger than that of MPW. 

Each Stage Consists of One Particle Window So far, 
we have designed the new proposal distribution of iPW. 
The question is that how many particle windows are to 
be generated in each stage. In MPW, the exponential rule 
of (7) is used for setting the window number. But this is 
far from optimal. Intuitively, we think that it is optimal if 
each stage contains one particle window. However, it fails 
completely for MPW. Throughout the paper, iPW means the 
one where each stage has a single new particle window. 
The number of generated particle windows incrementally 
increases one by one. The first letter ’i’ of ”iPW” is named 
after ’’incremental”. 

2 ) Basic Algorithm of iPW: The core of iPW is itera¬ 
tively sampling particle windows from the proposal distri¬ 
bution g(’w) and updating the sets of rejection, acceptance, 
and ambiguity particle windows. Because the proposal 
distribution q{w) is a weighted average of the dented 
distributions u{w) and g{w), drawing a particle window 
from g(w) is equivalent to drawing from either u{w) or 
^(w) with the probabilities Pu and Pg, respectively. 

The basic algorithm of iPW is given in Algorithm 2. The 
output is Wp (the final set of positive particle windows) on 
which non-maximum-suppression is applied for final object 
detection. 

In the initialization step, the sets of rejection, acceptance, 
and ambiguity particle windows are emptied (line 2). The 
dented uniform distribution u{w) is initialized by the uni¬ 
form distribution u{'w) = 1/N because currently there are 
no rejection and acceptance particle windows (line 3). The 
mixture of dented Gaussian distribution ^(w) is initialized 
to be 0 because so far there are no ambiguity particle 
windows to really construct it (line 3). 

In each iteration, a particle window is generated from 
either u(yv) or g{^) until the predefined number Nipw of 
particle windows is obtained. According to the value /(w) 
of classifier response, the generated particle window w is 
classified as rejection, acceptance, or ambiguity particle 
window. If v^ is classified as rejection particle window, 
then all the windows Wp(v^) in the Rp(v^) (region of 
rejection particle window of v^) will be remerged into 
Wp. If w is classified as acceptance particle window, 
then all the windows Wyi(w) in the R^(w) (region of 
acceptance particle window of w) will be remerged into 
W^. Otherwise it will be put into the window set 
(see line 9, 10 and 11). 
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Algorithm 2 The basic algorithm of iPW. 

Input: 

The number N of all candidate windows; 

The number Nipw of total particle windows; 

High and low classifier thresholds th and tp respec¬ 
tively; 

Output: 

The set Wp of positive particle windows. 

1: Initialization 

2: Empty the sets of rejection, acceptance, and ambi¬ 
guity particle windows: Wp ^ 

and Wab ^ respectively. Let the numbers of 
rejection, acceptance, and ambiguity particle windows 
be Nr = 0, Na = 0, and Nab = 0. 

3: Initialize the dented uniform and dented Gaussian dis¬ 
tributions by fi(w) ^ 1/N and ^(w) ^ 0. 

4: Iteration: 

5: for i = 1 to NiPw do 

6 : Pu = ax(l- Pg = l- Pu. 

7: Sample a particle window w from either fi(w) or 

^(w). The probabilities for w to be generated from 
fi(w) and ^(w) are Pu and Pg, respectively. 

8: Put w into Wp, W^, or W^p according to the 

classifier response: 

9: If f{w) < tu then Wp = Wp U Wr{w), Nr ^ 

|Wp|; 

10: If /(w) > th, then Wa = W^ U Wa(w), Na ^ 

|W^|, and Wp = Wp U w; 

11: If ti < f{w) < th, then Wap = Wap U av, 

A^ap^|Wap|. 

12: Update fi(w) and ^(w) using the updated Wp, Wa, 

and Wap- 

13: end for 
14: return Wp. 


Finally, based on the updated Wp, Wa, and Wap, the 
dented distributions fi(w) and ^(w) are updated according 
to (14) and (16), respectively. 

Characteristics of the iPW algorithm Fig. 6 demon¬ 
strates the characteristics of the iPW algorithm. Because 
each stage (iteration) generates a single particle window, the 
cumulative number Nipw{i) of generated particle windows 
at stage i is Nipw{i) = Among the Nipw{i) particle 
windows, Pu{i) fraction is sampled from fi(w) whereas 
Pg{i) fraction is sampled from ^(w). That is, the number 
of windows coming from u{w) is N^py^{i) = Pu{i) x 
Nipw{i)^ and the number of windows coming from g{^) 
is Nfp^{i) 'Bi Pg{i) X Nipw{i)- Fig. 6(a) shows that most 
of the generated particle windows are from fi(w) in the 
first several stages. But its fraction (i.e., Pu{i)) decreases 
as iteration proceeds meanwhile the fraction Pg{i) increases 
(see Fig. 6(b)). 

The above phenomenon is explained as follows. Because 
the number of object windows is very small relative to 
the total number of windows in the image, drawing a 
particle window from the initial distribution u{y^) and 



-^,(0 

- 


(a) 


(b) 



(c) 


Fig. 6. Charac teris tics of iPW. 
and |(b)| The curves 

Nuii), 


(a) The curves of Nj p w ( 
orPn( 2 ) and 


0 ’ 

The curves of 


the distribution fii(w) in first few stages (e.g., i = 10) 
will result in rejection particle windows in a very large 
probability. Consequently, the number A/^p(i) of rejection 
particle windows increases very fast with i, but the num¬ 
ber A^a(^) of acceptance particle windows increases very 
slowly (see Fig. 6(c)), which makes the number Nu{i) = 
N—NR{i)—NA{i) of unvisited windows be very large for 
small i. According to (19), Pu{i) Pg{i). But as iteration 
proceeds, NR{i) and A^a(^) increase monotonically making 
Pu decrease. 

As NR{i) and A^a(^) increases monotonically with i, 
so it can pay more attention to the other unvisited poten¬ 
tial areas. This characteristic makes iPW not to generate 
unnecessary too many particle windows around the object 
and object-like regions. It is known that classification of 
these regions is very time-consuming if cascade AdaBoost 
is adopted. This is one of the advantages of iPW over MPW. 

3) Semi-incremental Version of iPW (siPW): Algorithm 
2 is a purely incremental algorithm which has some prob¬ 
lems. First, in the first few iterations the number of particle 
windows is very small, so A/'ab(^) in it is much smaller. In 
this case, ^(w) can’t reflect the probability distribution of 
whole image. In order to have enough ambiguity particles 
to represent the probability distribution, the variables N^ 
and b are introduced in Algorithm 3. Through them, the 
particle windows are forcibly sampled from fi(w) in first 
several stages until that a certain number N^ of particle 
windows, especially the ambiguity particle windows, is 
available. By this way, it can better reflect the probability 
of whole image. Second, if a particle window is sampled 
from ^(w) and then ^(w) is immediately updated, the 
latter sampled particle windows will be heavily centered 
on the strongest classifier response regions. Namely, the 
regions with strongest responses will be enhanced more and 
more, while the regions with the relative lower responses 


















SHELL 3;. pang, et al\ LEARNING SAMPLING DISTRIBUTIONS LOR ELLICIENT OBJECT DETECTION 


9 


Algorithm 3 Semi-incremental version of iPW. 

Input: 

The number N of all candidate windows; 

The number Nipw of total particle windows; 

High and low classifier thresholds th and tp respec¬ 
tively; 

The threshold number for the number of ambiguity 
particle windows 

Output: 

The set Wp of positive particle windows. 

I: Initialization 

2 : Empty the sets of rejection, acceptance, and ambi¬ 
guity particle windows: Wp T>,Wa ^ 
and Wab ^ respectively. Let the numbers of 
rejection, acceptance, and ambiguity particle windows 
be Nr = 0, Na = 0, and Nab = 0. 

3: Initialize the dented uniform and dented Gaussian dis¬ 
tributions by fi(w) ^ 1/N and ^(w) 0. 

4: Initialize the binary indicator 6 = 0 and the number of 
cumulative particle windows Nq = 0 . 

5: Iteration: 

6 : for i = 1 to NiPw do 

7: If 6 = 0, then Pu ^ I and Pg ^ 0, else Pu = 

a X (1 - ) and Pg = 1 - Pu. 

8 : Sample a particle window v^ from either u{w) or 

g{w) with Pu and Pg, respectively. Nc = Nc + 1 . 
9: Put w into Wp, Wa, or W^p according to the 

classifier response: 

10 : If /(w) < ti, then Wp = Wp U Wp(w), Nr ^ 

|Wp|; 

II: If f{w) > th, then Wa = W^ U Wa(w), Na <- 

|Wa|, and Wp = Wp U w; 

12 : If ti < f{w) < th, then Wab = W^p U av, 

Nab^\WabI 

13: Update ii(w) using the updated Wp and W^. 

14: If Nc = NX, then 6 = 1 , update q(w), W^p ^ 

N*=NfjX e-^, and Nc = 0. 

15: end for 
16: return Wp. 


where object exists will be ignored. So instead of updating 
^(w) per particle window, it is wise to update ^(w) 
until there is a certain number of particle windows (line 
14). To overcome the above problems, a semi-incremental 
version of iPW (i.e.. Algorithm 3) is proposed. The main 
differences from Algorithm 2 are written in italic. We 
call it semi-incremental algorithm because ^(w) is updated 
once there is a certain number Nf, of pari tide windows, 
though each stage has one particle window and the rejection 
regions are updated in purely incremental manner. 

4) Efficiently Sampling From Dented Uniform and Gaus¬ 
sian Distributions: As can be seen from (1), the compu¬ 
tation time of an object detection algorithm is composed 
of the time of window generation, feature extraction, and 
classification. So it is important for iPW to efficiently 
generating particle windows from fi(w) and ^(w). 

To efficiently draw a particle window from u{w), we. 


Algorithm 4 Draw a particle window w from u{w) (or 

g(w)). _ 

Input: 

The sets of rejection and acceptance particle windows: 
Wp and WA, respectively; 

The maximum iteration number A^max; 

Output: 

a particle window w. 

I: for n = 1 to A^max do 

2 : Draw a window from the uniform distribution 

u{w) (or g{w)). 

3: If w ^ Wij and w ^ W^, then w = w„, and 

break. 

4: end for 
5: return w. 


in Algorithm 4, propose to iteratively draw a window 
from standard uniform distribution u{'w) until it does not 
belong to Wp or Wa- Similarly, to efficiently draw a 
particle window from g{w), in Algorithm 4 we propose to 
iteratively draw a window from standard mixture of Gaus¬ 
sian distribution g{'w) until it does not coincide with the 
elements of Wp and W^. As one can design algorithm for 
checking w G Wp and w G W^ in an extremely efficient 
manner, the computation time of window generation in iPW 
is negligible. The maximum iterations number A^max is used 
for avoiding infinite loops. 

V. Experimental Results 

A. Experimental Setup 

Experiments are carried out on the INRIA pedestrian 
dataset and the MIT-CMU face dataset to compare the 
proposed iPW with MPW and SW. To detect pedestrians in 
INRIA dataset, HOG and SVM fT2\ are used for features 
and classifier, respectively. Haar-like features and cascade 
AdaBoost classifier are employed for detecting faces in 
the MIT-CMU dataset (HI. The source code is publicly 
accessible at http://yanweipang.com/papers 

Intermediate results are also given to show the rationality 
of the assumptions mentioned before. 

B. Results on the INRIA Pedestrian Database. 

In the INRIA dataset, the positive training set consists 
of 1208 normalized pedestrian windows, and the negative 
training set contains a mass of windows sampled from 
1218 big and non-pedestrian images. The image size of the 
training window is 128 x 64 pixels, from which a 3780- 
dimensioned HOG feature vector is extracted. A linear 
SVM classifier /(w) is obtained from the training sets. 

As can be seen from Algorithms 2 and 3, the explicit 
parameters of iPW are f, th, Nfj, a and 7 . In our 
experiments, ti = —2.0 and th = 0 are used. Because 
rejection and acceptance particle windows are defined by 
not only ti and th, but also vr and r^. So the parameters 
also include vr and va- In ( 8 ) and (10), the regions of 
rejection and acceptance are circular and isotropic whose 
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TABLE III 

Detection rates vary with the number N oe particle 
WINDOWS WHEN FPPI = 0.1. 


N 

2367 

7100 

II833 

16567 

21300 

26033 

MPW 

0.469 

0.594 

0.614 

0.623 

0.625 

0.627 

iPW 

0.561 

0.620 

0.625 

0.627 

0.628 

0.630 


size are determined by vr and respectively. However, 
because the height h of the pedestrian is larger than its 
width w, it is more reasonable that the region of rejection 
and acceptance is rectangle. The size of the rectangle is 
represented as x r^. Likewise, the size of region of 
acceptance is represented as rj x r^. As stated in Section 
4.2, and depend on the classifier response /(w). In 
our experiments, and are quantized to 9 intervals 
according to the value of /(w). and also depend 
on the object width w and height h in question. Solid 
experiments are conducted to find rule for setting and 
according to /(w), h and w. Table 2 shows how to 
choose and r^. Note that h = 12S and w = 64. In 
Algorithm 3, we only use and when /(w) belongs 
to the first four intervals, r^/w and r^/h are set to 0.16 
and 0.16, respectively. = O.bNiPw is employed in the 
initialization step of Algorithm 3. The parameters a and 7 
are set 0.2 and 0.7, respectively. 

It is noted that regions of rejection and acceptance 
are cubic when scale factor is considered. The testing 
image is zoomed out by a factor 1/1.05. If a window is 
rejected at current scale s, which belongs to the interval 
^Interval, then the windows in adjacent scales s' from 
s X to 5 / 1 . 05 ^“^^’^*®’^^“^ with the size 

0.8^r^ X O.S'^r^ (A = llog^^Q^gj) are also rejected. If 
a window is accepted at current scale s, then the windows 
in adjacent scales s' from s x 1.05^ to s/1.05^ with the 
size 0.8^r^ x 0.8^r^ (A = | logJ^Q^g |) are also accepted. 

Table 3 compares iPW with MPW in terms of detection 
rate when they generate and examine the same number 
N of particle windows. When N = 2367, the detection 
rate of iPW is 0.561 whereas the detection rate of MPW 
is 0.469, meaning that the detection rate of iPW is 9.2% 
higher than that of MPW. As N decreases, the advantage 
of iPW becomes more remarkable. 

To further see the advantage of iPW, we show in Fig. 
7 a specific detection result of iPW and MPW when N 
is as small as 500. The red dots in Fig. 7(a) indicate the 
centers of particle windows in the last stage of MPW. 
Fig. 7(b) gives the final detection result by non-maximum 
suppression, where the man is detected whereas the woman 
is missed. The ambiguity particle windows of the last stage 
of iPW are shown in Fig. 7(c) and the final detection 
result is shown in Fig. 7(d). On the one hand. Fig. 7 
demonstrates that MPW fails to detect the woman when 
a small number of particle windows is sampled whereas 
iPW is capable to localize both the woman and man. iPW 
generates the particle windows one by one. If the generated 
particle window has a lower classifier response, then the 
particle window (i.e., rejection particle window) will tell 



(a) (b) (c) (d) 


Fig. 7. The detection results of iPW and MPW under 50 0 pa rticle 
windows. |(a)| The particle win dows of the last stage in MPW. [(b)] Final 
detec tion result of MPW. |(c)| The particle windows of the last stage in 
iPW. |(d)| Final detection result of iPW. 



(a) (b) (c) (d) 



(e) (f) (g) (h) 


Fig. 8. MPW generates unnecessary too many particle windows around 
the pedestrian regions, (a), (b) and (c) are the updating process of particle 
windows in MPW. (d) is the final detection result of MPW. (e), (f) and 
(g) are the updating process of particle windows in iPW. (h) is the final 
detection result of iPW. 

the next particle window not to sample from it and its 
neighboring region (i.e., region of rejection). As a result, 
other regions including the object regions will have larger 
possibility to be investigated. By contrast, MPW does not 
have the rejection mechanism. When the current particle 
windows do not contain clues of the objects, it is almost 
impossible for the latter particle windows to capture the 
location information of the objects. 

On the other hand, comparing Fig. 7(a) and (c), one can 
observe that MPW generates unnecessary too many particle 
windows around the man, whereas iPW can properly assign 
the limited number of particle windows to both the man 
region and the woman region. This phenomenon can be 
more clearly seen form Fig. 8. Fig. 8(a)-(c) give the updat¬ 
ing process of particle windows in MPW, Fig. 8(d) shows 
that MPW is able to detect two persons if there is enough 
number of particle windows in the initialization. Fig. 8(e)- 
(g) give the updating process of particle windows in iPW, 
Fig. 8(h) shows that iPW detect even four pedestrians, 
including a false positive. 

Fig. 9 shows how Nf^{i) = |Wfl| + \Wa\ and 
Np^{i) = N — |Wii| — |Wyi| vary with the number 
i of generated particle windows of iPW algorithm and how 
N^PW [pj = i and N^p^ {%) = N—i vaiy with i of MPW 
algorithm. One can see that the number N^p^ (i) of visited 
windows of iPW grows much faster than that of MPW. 
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TABLE II 

Set and according to /(w), h and w. 


Ft Interval 

0 

I 

2 

3 

4 

5 

6 

7 

8 

/(w) 

[-inf, -4.0] 

[-4.0, -3.5] 

[-3.5, -3.0] 

[-3.0, -2.5] 

[-2.5, -2.0] 

[-2.0, -1.5] 

[-1.5, -I.O] 

[-1.0, -0.5] 

[-0.5, 0.0] 

r^/w 

0.22 

0.18 

0.16 

0.12 

O.IO 

0.06 

0.06 

0.02 

0.02 

rl/h 

0.22 

0.18 

0.16 

0.12 

O.IO 

0.06 

0.06 

0.02 

0.02 



Fig. 9. The variations of and . 

Equivalently, the number (i) of unvisited windows of 

iPW drops much faster than that of MPW. So generating 
the same number of particle windows, iPW can classify 
(reject or accept) more windows (regions) than MPW. This 
explains why iPW obtains better detection accuracy than 
MPW when they use the same number of particle windows. 

If a smaller number of particle windows is generated, can 
iPW achieve the same detection accuracy as MPW? If it is 
true, then one can conclude that iPW is more efficient than 
MPW. To answer this question, SW is used as a baseline. 
It scans the image with the pixel stride 8 and scaling factor 
1.05, and the number of scanned windows is denoted by 
Nsw- varies with the size of testing image. For 

a 480 X 640 image, Nsw is 47335. Let MPW generate 
Nmpw = 0.3 X Nsw particle windows and iPW generate 
Nipw = 0 . 15 X Nsw particle windows. The resulting 
curves of miss rate vs. FFPI are plotted in Fig. 10. It is seen 
that these different window generation algorithms have very 
close operating points. For example, the miss rates of SW, 
MPW and iPW are 23.4%, 22.8%, and 23.4%, respectively 
when FFPI=1. At these operating points, the average values 
of Nsw^ Nmpw and NiPw in INRIA are shown in 
Table 4. Table 4 shows that to achieve the same operating 
point SW has to investigate 47335 windows whereas it is 
enough for iPW to generate and check 7099 windows. The 
detection time Tsw of SW is 3.94 times of that (i.e., Tipw) 
of iPW. Moreover, Nipw /Nmpw = 0.499 means that 
using half of particle windows iPW can obtain the same 
detection rate as MPW. The ratio of detection time Tmpw 
of MPW and detection time TiPw of iPW is 1.8, implying 
much higher efficiency of iPW than MPW. 

C. Results on the MIT-CMU Face Database. 

In Section 5.2, the feature and classifier are HOG and 
SVM, respectively. In this section, we evaluate iPW by 
using Haar-like features and cascade AdaBoost classifier 



Fig. 10. DET curves comparing iPW with MPW and SW on INRIA. 


for detecting faces in the standard MIT-CMU face database. 
The testing set consists of 125 images with 483 frontal 
faces. A 10 layers cascade model is learnt from 20000 
normalized 20 x 20 small face images and 5000 non-face 
large negative images. 

Because the range of response of cascade AdaBoost 
is quite different from SVM, the low and high classifier 
thresholds ti and are also different from those in Section 
5.2. Specifically, ti = 0.2 and th = 1.0 are adopted. 
Consequently, the length rp and va of regions of rejection 
and acceptance should be tuned, rp and va are related 
to /(w), h and w. But the detection window is square, so 
h = w. The classifier response f{w) of a. cascade AdaBoost 
is defined by f{w) = j^/L, where jw is the index j 
of the last stage which provides a positive classification 
for w, and L = 10 is total number of the stages of the 
cascade structure. The relationship between rp, /(w) and 
h is given in Table 5 where rp/h monotonously decreases 
with /(w). In Alogorithm 3, we only use the rp when 
/(w) belongs to the first two values, r^/w and r/^/h are set 
to 0.1 and 0 . 1 , respectively. N^ = O.bNiPw is employed 
in the initialization step of Algorithm 3. The parameters a 
and 7 are set 0.2 and 0.7, respectively. 

Similar to Section 5.2, regions of rejection and accep¬ 
tance are cubic. The testing image is zoomed out by a 
factor 1/1.15. If a window is rejected at current scale s, the 
windows in neighboring scales s' from s x 1.15 to s/1.15 
with the size 0.5^r^ x 0.5^r^ (A = | logJ ^/5 |) form the 
Wi? of rejection particle windows. If a window is accepted 
at current scale s, then the windows in adjacent scales s' 
from s X 1.15 to s/1.15 with the size 0.5^r^ x 0.5^r^ 
(A = I log ^^/5 I) are also accepted. 

With the above parameters, iPW is applied to 125 test¬ 
ing images and detection rates corresponding to different 
number N of particle windows are shown in Table 6 . Table 
6 also gives the detection rates of MPW. It is observed that 
iPW has almost higher detection rate in each case. When 
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TABLE IV 

Efficiency of SW, MPW and iPW on INRIA. 


Nsw 

Nmpw 

Nipw 

NiPw /Nmpw 

Tmpw/T iPw 

Tsw/T mpw 

Tsw/TiPw 

47335 

14200 

7099 

0.49 

1.80 

2.19 

3.94 


TABLE V 

Set rji according to /(w) and h. 


/(w) 

0.0 

0.1 

0.2 

0.3 

0.4 

0.5 

0.6 

0.7 

0.8 

0.9 

rp/h 

0.100 

0.090 

0.060 

0.050 

0.050 

0.040 

0.040 

0.030 

0.040 

0.030 


TABLE VI 

Detection rates vary with the number N oe particle 
WINDOWS WHEN FPPI = 0.1. 


N 

25066 

75200 

125333 

175466 

225600 

275733 

MPW 

0.583 

0.758 

0.788 

0.806 

0.811 

0.812 

iPW 

0.676 

0.792 

0.806 

0.813 

0.816 

0.819 



(a) (b) (c) (d) 


Fig. 11. Th e detection results of iPW and MPW under 500 0 pa rticle 
windows. |(a)| The particle win dows of the last stage in MPW. Final 
detec tion result of MPW. |(c)| The particle windows of the last stage in 
iPW. |(d)| Final detection result of iPW. 


the number of particle windows is small, the advantage of 
iPW is more remarkable. For example, when N = 25066, 
the detection rate of iPW is 9.3% higher than that of MPW. 

Fig. 11(b) and (d) respectively show the detection results 
of MPW and iPW when the number of particle windows 
is limited to 5000. Clearly, iPW is capable of detecting 
the two faces in the testing image whereas MPW does not 
detect any face at all. Fig. 11(a) and (c) show the centers 
of the particle windows generated in the last stage of MPW 
and iPW, respectively. 

Fig. 12(a)-(d) show that when the number of particle 
windows of MPW is upto 20000, MPW can detect the two 
faces in the testing image. But MPW assigns too many 
particle windows around object and object-like regions. The 
iterations of iPW are shown in Fig. 12(e)-(h) which assign 
proper number of particle windows around object regions. 
The intuition is that if there are a few acceptance particle 
windows around the object, then it is no longer necessary 
to generate additional particle windows around the object. 
Instead, the opportunity should be given to check other 
region. 

Finally, experiments are conducted to see whether iPW 
can achieve comparable face detection results as MPW 
if a smaller number of particle windows is used. As 
in pedestrian detection experiments, SW is also used as 
baseline. It slides the testing image with pixel stride 2 
and scale factor 1.25. As a result, SW generates 621816 




(a) (b) (c) 





(e) (f) (g) (h) 


Fig. 12. (a), (b) and (c) are the updating process of particle windows 

in MPW. (d) is the final detection result of MPW. (e), (f) and (g) are the 
updating process of particle windows in iPW. (h) is the final detection 
result of iPW. 



Fig. 13. DET curves comparing iPW with MPW and SW on MIT-CMU. 


and 3941097 windows for 454 x 628 and 1024 x 1280 
images, respectively. Limit the numbers of particle win¬ 
dows in MPW and iPW to Nmpw = 0.25 x Nsw and 
Nipw = 0-13 X Nsw^ respectively. The resulting ROC 
curves of SW, MPW and iPW are shown in Fig. 13. It is 
seen from Fig. 13 that SW is almost consistently inferior to 
both MPW and iPW. Even only 0.13 fraction of windows 
are used, iPW can obtain higher detection rates than SW. 
Moreover, one can see Fig. 13 that the miss rates of iPW 
are almost identical to that of MPW. 

Table 7 shows the testing time Tsw of SW is 1.15 times 
and 1.73 times of MPW and iPW, respectively, when the 
number of generated windows of SW, MPW and iPW are 
501334, 125333 and 65173. The miss rates of the three 
algorithms correspond to the point in Fig. 13 with FPPI=1. 
The speedup effect in face detection is not as significant as 
pedestrian detection. The reason is that the number of HOG 
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TABLE VII 

Efficiency of SW, MPW and iPW on MIT-CMU. 


Nsw 

Nmpw 

Nipw 

NiPw /Nmpw 

Tmpw iTiPw 

Tsw/T mpw 

Tsw/TiPw 

501334 

125333 

65173 

0.520 

1.50 

1.15 

1.73 


features is fixed in pedestrian detection but the number 
of Haar-like features varies with the response of cascade 
classifier. 

VI. Conclusion 

In this paper, we have proposed how to improve MPW. 
The proposal distribution of MPW mainly relies on the 
regions of support. In contrast, the proposed iPW and 
siPW algorithms construct the proposal distribution based 
on the proposed concepts of rejection, acceptance, and 
ambiguity particle windows which are defined by low 
and high thresholds of the classifier response. Both the 
rejection and acceptance particle windows are used for 
reducing the search space. The existence of the objects 
is refiected in the acceptance particle windows and the 
main clue of object locations is contained in ambiguity 
particle windows. Specifically, our proposal distribution is a 
weighted average of dented uniform distribution and dented 
Gaussian distribution which are dented by the rejection and 
acceptance particle windows. An important characteristic of 
the proposed algorithms is that single particle windows is 
generated in each stage, which makes iPW to run in an 
incremental manner. Experimental results have shown that 
iPW is about two times efficient than MPW. 
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